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The present invention relates to, in a media system, a method of generating at 
least one output signal from at least one input signal from a second set of sound signals 
having a related second set of Head Related Transfer Functions. 

The present invention also relates to a computer system for performing the 

method. 

The present invention further relates to a computer program product for 
performing the method. 

This invention further relates to a media system for generating at least one 
output signal from a first set of sound signals from at least one input signal from a second set 
of sound signals having a related second set of Head Related Transfer Functions. 



WO 01/49073 discloses a sound reproduction system simulating external 
sound sources. The system uses a number of so-called Head Related Transfer Functions, 

1 5 HRTFs, to generate sound for a set of headphones. 

It is generally known in prior art literature that input channels of sound sources 
which are to be combined into outputs, i.e. resulting sound signals, will require a relatively 
high number of HRTFs. This typically leads to system implementations with said HRTFs, 
which are quite expensive, require unnecessary convolutions and are complex to design. This 

20 will be discussed further by means of figure 1 and 2, where prior art applications and the 
invention with corresponding formulas and numbers of HRTFs are shown by means of 
calculation. 

25 The above problems are solved by said method, the method comprising the 

steps of: 

• determining, for each signal in the second set of sound signals, a weighted relation 
comprising at least one signal from a third set of intermediate sound signals and at least 
one weight value; 
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• determining a first set of Head Related Transfer Functions based on the second set of 
sound signals, the second set of Head Related Transfer Functions and the weighted 
relation; and 

• transferring at least one signal from the third set of intermediate sound signals by means 
5 of at least one HRTF from said first set of Head Related Transfer Functions in order to 

generate at least one output signal belonging to said first set of sound signals. 

In the first step, for each sigialln'th^ectmd'sret of suuud siguarterireTibipeach 
signal in a number of input sound signals, a weighted relation comprised by intermediate 
sound signals and at least one weight value is determined. Hereby said input sound signals 
10 are converted to intermediate sound signals for a subsequent internally use. 

In tibie second step, said first, but new set of HRTFs is then determined based 
on the second set of sound signals, typically input sound signals and said second set of Head 
Related Transfer Functions, related to said input sound signals and initially dedicated to 
transform or transfer said second set of input sound signals. 
15 It is an advantage that in said determination - which will be discussed in the 

embodiments according to the invention - the new set of HRTFs comprises fewer HRTFs 
than said second set of Head Related Transfer Functions originally dedicated to transfer the 

input sound signals. 

Subsequently, in the third step, said new, but fewer HRTFs (i.e. first set of 
20 Head Related Transfer Functions) are used to generate one or more output signal (belonging 
to said first set of sound signals) since one or more signals from the third set of intermediate 
sound signals is transferred by means of said new, lower number of HRTFs in order to obtain 
said output signals. 

Said problems are further solved by said media system on which said method 
25 can be executed. The media system may be a TV, a CD player, a DVD player, a Radio, a 
display with sound, an amplifier, a headphone or a VCR. 

In a preferred embodiment, said media system comprising: 

• means for determining for each signal in the second set of sound signals, a weighted 
relation comprising at least one signal from a third set of intermediate sound signals and 

30 at least one weight value; 

• means for determining a first set of Head Related Transfer Functions based on the second 
set of sound signals, the second set of Head Related Transfer Functions and the weighted 
relation; and 
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• means for transferring at least one signal from the third set of intermediate sound signals 
by means of at least one HRTF from said first set of Head Related Transfer Functions in 
order to generate at least one output signal belonging to said first set of sound signals. 
The media system gives the same advantages for the same reasons as 
described previously in relation to the method. 



The prior art and the invention will be explained more fully below in 
connection with preferred embodiments and with reference to the drawings, in which: 

Fig. 1 shows examples of the generation of two output sound signals from 
three input sound signals in the prior art and according to the invention; 

fig. 2 shows the generation of two output sound signals from one input sound 

signal; and 

fig. 3 shows a method of generating at least one output, sound signal from at 
least one input sound signal from a second set of input sound signals having a related second 
set of Head Related Transfer Functions. 

Throughout the drawings, the same reference numerals and like names 
indicate similar or corresponding features, functions, etc. 



In the present invention a set of head related transfer functions (HRTFs) may 
be used to generate one or more sound signals. The HRTFs maybe defined as functions 
describing how sound propagates from a specific sound source to the ear and the number of 
HRTFs belonging to a set, this could be from one HRTF describing sound propagation from a 
source to the two ears and to a number of HRTFs depending on the number of sources 
delivering sound. Alternatively, from few (n) inputs signals, m intermediate signals are 
derived which needs 2 times m HRTFs (m > n ) head related transfer functions (HRTFs) may 
be used to expand said input signals (as the source) into multi-channel sound (as an 
intermedia product), which then may be down-mixed to fewer resulting output sound signals, 
e.g. a Left and a Right signal for a headphone. 

A description of HRTF can further be found in the following journal: H. 
Moller, "Fundamentals of binaural technology", Applied Acoustics, Special issue on auditory 
environment and telepresence, Vol. 36, No. 3-4, pp. 171-218, (1992). 
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In the following HRTF is defined in further detail. By finding the sound 
pressure that an arbitrary source produces at the eardrum (taking into consideration 
parameters such as the distance between the ears and the shape of the outer ear), all that is 
needed is the impulse response from the source to the eardrum, which can be measured e.g. 
by placing a microphone in the ear. This is called the Head-Related Impulse Response, and 
its Fourier transform is called the Head Related Transfer Function (HRTF). The HRTF 

right ear are known, it is possible to synthesize accurate binaural signals from a monaural 
source. The head related transfer function is well known and is described in a number of 
documents, such as Blauert, Spatial hearing: The Psychophysics of Human Sound 
Localization (MIT Press, Cambridge, MA, 1983). When sound is filtered by a set of HRTFs 
the sound is optimised for the person to which the set of HRTFs belongs and therefore the 
sound experience is never optimal for anyone but the person to which the set of HRTFs 
belongs. The set of HRTFs are filter functions with parameters or coefficients being specific 
for specific persons. For a specific person different sets of HRTFs can be obtained depending 
on the arbitrary source mentioned above, the distance between the source and the person and 
also on the characteristics of the room in which the function parameters are measured. When 
e.g. the source is headphones, the HRTFs depend on the headphone through which sound 
reproduction takes place. The result of filtering sound using this function is that an optimal 
spatial reproduction of surround sound in headphones is obtained. The source could also be a 
typical loudspeaker; in this case it is necessary to perform cross-talk cancellation, which e.g. 
can be based on the HRTF. 

Stereophonic sound signals comprise a left and a right signal component 
which may originate from a stereo signal source, for example from a set of microphones, e.g. 
via further electronic equipment, such as a mixing equipment, etc. The signals may further be 
received as an output from another stereo player, over-the-air as a radio signal, or by any 
other suitable means. 

Fig. 1 shows examples of the generation of two output sound signals from 
three input sound signals in the prior art and according to the invention. Said two sound 
signals may in a typical use comprise a stereophonic signal distributed to two speakers in a 
headphone. 

Firstly, according to the prior art, it is well known to reproduce multi-channel 
sound via headphones. This multi-channel sound reproduction through a headphone makes 
use of the known techniques called binaural and Head Related Transfer Function (HRTF). 
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The term "binaural" refers to the fact that there are two inputs to the listener's ears (left and 
right). Any set of left and right channel signals that are recorded at the position of the 
eardrum are called binaural signals. 

It is the intention to have the same sound at the eardrum when using a 
headphone as when loudspeakers are playing. In order to achieve this, more knowledge must 
be gathered about the transmissions of the sound source into the eardrum. This transmission 
is best described in terms of Head Related Transfer Functions (HRTF) that include any linear 
filtering, such as coloration and inter-aural time and spectral differences. Inter-aural time 
differences occur because a sound wave travels at two different distances to left and right ear. 
These transfer functions depend on the angle of incidence and distance to the sound source. 

Reverting back to the figure, reference numerals 1, 2 and 3 indicate the 
corresponding three channels (i.e. three input sound signals) CHi,, CH 2 and CH 3 combined 
into a left, H PL and a right H PR resulting (output) sound signal for the headphone. Said 
channels are each transmitted by means of three related Head Related Transfer Functions, 
reference numerals 4 through 9. In other words, CHi, is transmitted by means of the Head 
Related Transfer Function HRTFi, correspondingly CH 2 , is transmitted by means of the Head 
Related Transfer Function HRTF 2 , etc. This is performed for both channels in order to 
achieve - by summation of products of channels and related HRTFs, reference numerals 10 
and 1 1 - that the stereophonic signals are generated. Said stereophonic (output) signals are 
indicated by left, H PL reference numeral 12, and right, H PR reference numeral 13, as the two 
resulting sound signals. 

The summation for the left resulting sound signal is then: 

H PL = CH! • HRTFi.l + CH 2 • HRTF 2>L + CH 3 • HRTF 3 , L (1) 
Correspondingly, summation for the right resulting sound signal will then be: 
Hpr= CH, • HRTFi, R + CH 2 • HRTF 2(R + CH 3 • HRTF 3>R ( 2 ) 

Thus in the prior art case, this transmission will require two times three, i.e. 
six Head Related Transfer Functions. 

Generally throughout the application, the notation " • " denotes a product if the 
above-mentioned variables are in the frequency domain; whereas in the time domain, " • " 
denotes a convolution of the variables. 



PHNL020901EPP 

W 

6 17.09.2002 
Generally and correspondingly, when expanding the prior art example, n=3 
(input) channels of sound sources (CHi to CH 3 ) to be combined into m sound outputs, i.e. m 
resulting sound signals, will require n times m Head Related Transfer Functions. 

Secondly, according to a preferred embodiment of the invention, the same 
transmission - as the prior art example - may be implemented in a different way. In order to 
continue the example, the same three channels (CHi, CH 2 , and CH 3) will be discussed.lt is 

iesemay~b^~ttn^arcombiiiaLioiis ui a \ veightedrversk>n-^fHhe4ef^aad^4ght 

(intermediate) channel with the weights a and p. Said a and p may have their weight values 
depending on each channel, i.e. L and R, thus in general: 

CHi = ai»L+pi»R. (3) 



Someone skilled in the art may - when applying the invention for more than 
two channels (L, R), e.g. for a third, a fourth channel, etc, i.e. C, D, etc - subsequently 
generalize formula (3) into: 

CHi = cti • L + pi • R + cj • C + di • D, etc for a corresponding higher number of 

resulting (output) sound signals (H pl ,H P r,Hpc,Hpd, etc.) for corresponding speakers or end 
result sounds. 

In the Sound Engineering Society Conference Paper, presented at the 19th 
International Conference 2001 June 21 - 24 Schloss Elmau, Germany by Roy Irwan and 
Ronald M. Aarts, Philips Research Laboratories, a method to convert stereo to multi-channel 
sound is disclosed. In this paper - on page 3 - said a and P' s are defined using a 
corresponding W L (k) and W R (k) (weight) notation - at the time instant k - for the left and 
right channel, respectively. 

For the sake of conciseness, two channels (of resulting (output) sound signals) 

will only be used in this example. 

Continuing the prior art example of figure 1, but implemented in a preferred 
first embodiment of the invention according to the following: 



CHi=ai -L+p t *R 



(4) 
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CH 2 = a 2 «L+p 2 -R (5) 

CH 3 =a 3 »L + p 3 .R ( 6 ) 

It is found that formula (1) and (2) may still be applied for the summation (of 
products of channels and related HRTFs), thus when (4)(5) and (6) are inserted in (1) and (2), 
it gives: 

H PL = (a, • L + 0! • R) • HRTFi, L +(a 2 • L + p 2 • R) • HRTF 2>L + (a 3 • L + p 3 . 
R) • HRTF 3( l (7) 

H PR = (ai • L +pi • R) • HRTFi, R + (a 2 ♦ L + p 2 • R) • HRTF 2 , R + (a 3 • L + p 3 • 
R) • HRTF 3( r ^ 

Or expressed differently: 

H PL = L • (ai • HRTFi >L + a 2 • HRTF 2 , L + a 3 • HRTF 3(L ) + R • (pi • 
HRTFi, L +p 2 • HRTF 2 , L + p 3 • HRTF 3ft ) ; (9) 

Accordingly, 

H PR = L • (ai • HRTFi, R + a 2 • HRTF 2>R +a 3 • HRTF 3 ,r) + R • (p t . 
HRTF,, R + p 2 • HRTF 2>R + p 3 • HRTF 3>R ) ; (10) 

However, note - the HRTFs discussed so far in respect of the invention - are 
merely used as intermediate variables in the formulas - and are not and need not as opposed 
to the discussion relating to said prior art be implemented as real Head Related Transfer 
Functions. 

Or for i = 3, i.e. in a generalized form: 
H n =L •HRTF u )+R*Y t (j3 l •HRTF [L ) (11) 

H Pk =L. YMt • ^ )+* • • HRTF tR ) (12) 
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Thus there are only two filters for the Left headphone driver, H PL needed in 
order to filter the Left and Right signals respectively, since the factors in formula (1 1) E (otj • 
HRTFi, L ), £ pi • HRTF i(L ) are considered each as one filter. 

Correspondingly, with regard to formula 12, £(0* • HRTF i>R ) and EpV 
5 HRTFi, R ) are the two filters for the Right headphone driver, H PR . 

Thus only two filters are needed to filter the Left and Right signals for the 

Right headphone driver. 

Thus - when continuing the implementation according to the invention with 
three input sound channels - the transmission will now only require two times two, i.e. four 
10 Head Related Transfer Functions. Compared to the prior art example of figure 1 - where six 
Head Related Transfer Functions were required - the invention will require fewer Head 
Related Transfer Functions for the same transmission. 

Correspondingly, fewer convolutions will be required for the same 

transmission. 

1 5 in other words, when the example is further generalized - starting with and 

according to the prior art - in a simple cascading of sound signals, e.g. with m = 2 (i.e. stereo, 
two output channels or signals, e.g. for two headphone drivers), n = 5 input channels or sound 
signals (CHt to CH 5 ) will require a total of 2 times 5 that is 10 HRTF (in the prior art), but 
only four Head Related Transfer Functions for a similar transmission are still required 

20 according to the invention's first embodiment. 

Fig. 2 shows the generation of two output sound signals from one input sound 
signal. Said two sound signals may in a typical use again comprise a stereophonic signal 
distributed to two speakers in a headphone, however in this example - as a second 
embodiment of the invention - only one source, M of an input sound signal is discussed. 

25 Firstly, the prior art will be discussed with a calculation of HRTF's used: 

The prior art is applied for only one input channel (as in this figure), i.e. an 
input sound source M and then distributed to two resulting (output) sound signals H pl ,Hpr. 
Compared to and according to figure 1, in principle one channel (i.e. CH 3 ) less is used; 
correspondingly, the summation for the left resulting (output) sound signal in the prior art is: 

30 

H PL = CHi • HRTF_L,1 + CH 2 • HRTF_R,1 (13) 



And, correspondingly, summation for the right resulting (output) sound signal 

will then be: 
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Hpr = CHi • HRTF_L,r + CH 2 • HRTF_R,r (14) 

Here the first uppercase notation is each of the loudspeaker channels, L and R, 
respectively, and the second lower case notation is 1 for the left ear, r for the right ear. 

Thus in this prior art case, this transmission will require two times two, i.e. 
four Head Related Transfer Functions. 

Secondly, the second embodiment, i.e. figure 2, according to the invention will 

be discussed: 

Imagine a (moving) singer "M' in a studio is recorded onto a CD with two 
output sound channels, H PL andHpR. 

By using Principle Component Analysis, the necessary alpha's, ori's (as shown 
below in the formulas (15)) may be recovered. Hence two channels are used to locate the 
singer on the line between the loudspeakers. It may be the case that that the alpha's are time 
variant. 

A general discussion on Principle Component Analysis can be found in 
Principal Component Analysis by S. Haykin, Neural Networks, Prentice-Hall, N.J., 1999, 
Second Edition which is used in the previously mentioned article: "A method to convert 
stereo to multi-channel." 

The single sound (input) source, M may be anywhere between two 
loudspeakers. E.g. in a studio there is a singer M, pan-potted between both (or even more 
channels) so the left intermediate channel (CHIi) which may be expressed as aii • M and the 
right intermediate channel (Cffl 2 ) may be expressed as ori 2 • M, thus: 

CHIi = aii • M and CHr 2 = ai 2 • M (15) 

However, note - said channels (CHIi, CHI 2 ) in respect of the invention for this 
particular embodiment - are merely used as intermediate channels (variables) in the formulas 
- and are not real channels as opposed to the discussion (i.e. CH,, CH 2 ) relating to the prior 
art. 

In other words, - in respect of the invention - left and right (intermediate 
channels) are mapped onto one channel M. 

So equation 13 and 14 can - switching from the prior art to another 
embodiment of the invention according to figure 2 - be expressed as: 



10 



15 



20 
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H PL = oil • M • HRTF_L,1 + ai 2 • M • HRTF_R,1 (16) 

Hpr = aii • M • HRTF_L,r + ai 2 • M • HRTF_R,r (17) 



or 



H PL = M • (aii • HRTFT} + ai* • HRTFTCI ) W 
Hpr - M • (aii • HRTF_L,r + ai 2 • HRTF_R,r) (19) 



or 



H PL = M»H_1 C20) 

Hpr = M«H_2 ( 21 ) 

where H_l =(aii • HRTF_L,1 + ai 2 • HRTF_R,1 ) (22) 

and H_2= (ai, • HRTF_L,r + ai 2 • HRTF_R,r) (23) 



This shows that the invention needs only two convolutions or HRTFs, since 
the factors (H_l, H_2 ) in formula 20 and 21, respectively, are considered each as one HRTF 
filter. 

Thus the transmission will now only require two Head Related Transfer 
25 Functions. Compared to the prior art - where four Head Related Transfer Functions were 
required - the invention will require fewer Head Related Transfer Functions (and 
correspondingly convolutions) for the same transmission from one (input) sound source, M. 

However, said second embodiment of mapping only two output channels onto 
one channel is very simple, the second embodiment may be generalized to mapping of more 
30 than two channels onto one (with corresponding a's) as discussed in: 

The patent application WO0207481: Multi-channel stereo converter for 
deriving a stereo surround and/or audio centre signal, Koninklijke Philips Electronics N.V. 
Inventor(s): Irwan, Roy; AARTS, Ronaldus, M. Application No. EP01 07757, Filed 
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20010705, A2. Published 20020124, where two channels (L,R) are mapped onto one C, or 
centre channel, using Principle Component Analysis, and in C. Faller and F. Baumgartner, 
Binaural cue coding applied to stereo and multi-channel audio compression, Convention 
paper 5574 (L-6) of the 112th AES Convention Munich, Germany, Audio Eng. Soc, May 
5 2002. 

Someone skilled in the art may - when applying the invention according the 
two embodiments - combine and consider these as general-purpose (HRTF) functions blocks 
with sound inputs and outputs. In other words, said embodiments may be applied to cascade 
couple sound signals. In other words, instead of Hp L and H PR being output sound signals from 

I ') one function block, they may - by cascading - be input to another function block. 

Generally said formulas throughout the application may be implemented in a 
media system, such as a TV, a CD player, a DVD player, a Radio, a display, an amplifier or a 
VCR. This is shown by means of reference numeral 20 of figure 2. However, it may 
alternatively or additionally be the case that said formulas are integrated into a circuitry (or 

1 5 software) suitable for the purpose embedded in headphones with sufficient processing power. 

Transmission between channels, (input sound signals) CH f s and M to other 
intermediate sound channels and to resulting (output) sound signals or channels are drawn in 
the figures by lines with arrows. These lines may indicate that transmission may take place 
by means of circuitry suitable for enabling the communication of sound data, e.g. via a wired 

20 or a wireless data link. Examples of such transmission may be various transmitters, e.g. a 

transmitter including a network interface, a network card, a radio transmitter, a transmitter for 
other suitable electromagnetic signals, such as an LED for transmitting infrared light, e.g. via 
an IrDa port, radio-based communications, e.g. via a Bluetooth transceiver, or the like. 
Further examples of suitable transmitters include a cable modem, a telephone modem, an 

25 Integrated Services Digital Network (ISDN) adapter, a Digital Subscriber Line (DSL) 
adapter, a satellite transceiver, an Ethernet adapter, or the like. Correspondingly, a 
communications channel may be any suitable wired or wireless data link, for example of a 
packet-based communications network, such as the Internet or another TCP/IP network, a 
short-range communications link, such as an infrared link, a Bluetooth connection or another 

30 radio-based link. 

Further examples of the communications channel include computer networks 
and wireless telecommunications networks, such as a Cellular Digital Packet Data (CDPD) 
network, a Global System for Mobile (GSM) network, a Code Division Multiple Access 
(CDMA) network, a Time Division Multiple Access Network (TDMA), a General Packet 
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Radio service (GPRS) network, a Third Generation network, such as a UMTS network, or the 
like. 

Fig. 3 shows a method of generating at least one output sound signal from at 
least one input signal from a second set of input sound signals having a related second set of 
Head Related Transfer Functions. Said generation may take place in a media system, such as 
a TV, a CD player, a DVD player, a Radio, a display, an amplifier, a headphone and in a 

vcr: 

In a typical application of the method (or embedded in an apparatus such as 
said media system), said output sound signal may belong to a first set of output sound signals, 
e.g. one or more outputs such as H PL or Hpr directed to headphones or other speakers. 
Conversely, said second set of sound signals may be inputs such as CHi, CH 2 .*CHn and M. 
However, said (input) sound signals may - in a sound signal cascade chain with function 
blocks of HRTF - be considered as general purpose sound signals as inputs or outputs 
depending on whether they enter (as input) or leave (as output) a block of cascade coupled 
sound signals. In other words, output sound signals from one function block may be input 
(sound signals) to another function block and vice versa. 

Said second set of Head Related Transfer Functions (related to said input 
sound signals) may - from the discussed embodiments - comprise Head Related Transfer 
Functions (such as HRTFJL,1, HRTF_R,1, HRTF_L,r, HRTF_R,r, HRTF UL , HRTF 2 ,u 
HRTF 3 , U HRTFi |R , HRTF 2 ,r, ..etc. initially dedicated to transform or transfer said second 
set of input sound signals. 

In step 90, the method in accordance with preferred embodiments of the 
invention is started. Variables, flags, buffers, etc., keeping track of HRTFs, input and 
intermediate sound channels, output sound channels, weights, etc, corresponding to the sound 
signals processed are set to default values. When the method is started a second time, only 
corrupted variables, flags, buffers, etc, are reset to default values. 

In step 100 - continuing the method description - for each signal in the second 
set of (input) sound signals, a weighted relation may be determined. Said weighted relation 
may comprise at least one signal from a third set of intermediate sound signals, such as L and 
R; CHI] and CBDb, respectively (according to the two embodiments discussed) with 
corresponding weight values. 

As discussed in the embodiments of the invention, one example - as the first 
embodiment - may be CHi (i.e. each of the i input sound signals) = oci • L + pi • R, wherein oc*, 
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and Pi are weight values, and L and R each is a signal from said third set of the intermediate 
sound signals. 

In the first embodiment, more input sound signals than (generated) output 
sound signals are processed by means of fewer HRTFs as compared to the prior art. 
5 As further discussed in the embodiments of the invention, another example - 

as the second embodiment - may be CHIi = aii • M and CHI 2 = ai 2 • M, wherein aii and cri2 
each is the weight value, and where CHIi and CHI2 are the corresponding intermediate sound 
signals for this second embodiment. 

In the second embodiment - as opposed to the first embodiment - fewer input 

10 sound signals, generally (in the example one) than generated output sound signals (in the 
example two) are processed by means of fewer HRTFs as compared to the prior art. 

In step 200, a first (newly generated) set of Head Related Transfer Functions 
maybe determined. Said first set (of Head Related Transfer Functions) may be based on the 
second set of sound signals, i.e. the input sound signals, the second set of Head Related 

15 Transfer Functions (as discussed and used in the prior art) and the newly determined 

weighted relation(s). In other words, said first new set of Head Related Transfer Functions 
are generated for the purpose of a subsequent transformation of the intermediate sound 
signal(s) by means of it in the next step. The determination takes into account the second set 
of sound signals, i.e. inputs such as sound signals (typical as inputs) such as CHi, CH2 ..CHh 

20 and M, and said second set of Head Related Transfer Functions initially dedicated to 

transform or transfer said second set of input sound signals. Further, the determination takes 
said weighted relation (CHi = ai*L+pi*R, etc.) with corresponding intermediate signals 
(L, R etc) into account corresponding to the formulas used to explain the invention's two 
embodiments. 

25 La step 300, at least one signal from said third set of intermediate sound signals 

(L, R, CHIi, CHI2) may be transferred by means of at least one HRTF from said first set (of 
newly generated Head Related Transfer Functions) in order to generate at least one signal (as 
an output signal) belonging to said first set of output sound signals (H PL , H PR ). At this point, 
newly generated HRTFs, i.e. said first set of Head Related Transfer Functions ( £ (a\ 9 

30 HRTFi,r), E (Pi • HRTFi, R ), HJL, H__2, etc) may be used to, actually to transfer and transform 
(convolve) one or more intermediate sound signals, such as L, R, (first embodiment) or CHIi 
and CHI2 (second embodiment). As a result, at least one of the output sound signals H PL , H PR 
is then generated. 
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It is hereby an advantage by the invention that said generation will - as 
previously discussed in the embodiments - generally be performed by fewer HRTFs and 
convolutions than the prior art. 

Usually, the method will start all over again as long as the media system is 



5 powered. Otherwise, the method may terminate in step 400; however, when the media system 
is powered again, etc, the method may proceed from step 100. 

A computer readable medium may be magnetic tape, optical disc, digital 
versatile disk (DVD), compact disc (CD record-able or CD write-able), mini-disc, hard disk, 
floppy disk, smart card, PCMCIA card, etc. 
10 In the claims, any reference signs placed between parentheses shall not be 

constructed as limiting the claim. The word "comprising 11 does not exclude the presence of 
elements or steps other than those listed in a claim. The word "a" or "an" preceding an 
element does not exclude the presence of a plurality of such elements. 



15 distinct elements, and by means of a suitably programmed computer. In the device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 



The invention can be implemented by means of hardware comprising several 



20 
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1 . A method of generating, in a media system, at least one output signal from a 

first set of sound signals from at least one input signal from a second set of sound signals 
having a related second set of Head Related Transfer Functions, said method comprising the 
steps of 

5 • determining, for each signal in the second set of sound signals, a weighted relation 

comprising at least one signal from a third set of intermediate sound signals and at least 
one weight value; 

• determining a first set of Head Related Transfer Functions based on the second set of 
sound signals, the second set of Head Related Transfer Functions and the weighted 

10 relation; and 

• transferring at least one signal from the third set of intermediate sound signals by means 
of at least one HRTF from said first set of Head Related Transfer Functions in order to 
generate at least one output signal belonging to said first set of sound signals. 

15 2. A method according to claim 1, characterized in the step of determining for 

each signal, i in the second set of sound signals determines CH, - cti • L + J3j • R, wherein otf, 
and pi each is the weight value, and wherein L and R each is a signal from said third set of 
intermediate sound signals. 

20 3. A method according to claim 1, characterized in the step of determining for 

each signal determines CHI T = aii • M and CHI 2 = ai 2 • M, wherein aii and ai 2 each is the 
weight value, and wherein CHIi and CHI 2 each is a signal from said third set of intermediate 
sound signals. 

25 4. A method according to any one of claims 1 through 3, characterized in that the 

media system is a TV, a CD player, a DVD player, a Radio, a display, an amplifier, a 
headphone or a VCR. 
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5. A computer system for performing the method according to any one of claims 
1 through 4. 

6. A computer program product comprising program code means stored on a 

5 computer readable medium for performing the method of any one of claims 1 through 4 when 
the computer program is run on a computer. 

7. A media system for generating at least one output signal from a first set of 
sound signals from at least one input signal from a second set of sound signals having a 

10 related second set of Head Related Transfer Functions, said media system comprising: 

• means for determining for each signal in the second set of sound signals, a weighted 
relation comprising at least one signal from a third set of intermediate sound signals and 
at least one weight value; 

• means for determining a first set of Head Related Transfer Functions based on the second 
1 5 set of sound signals, the second set of Head Related Transfer Functions and the weighted 

relation; and 

• means for transferring at least one signal from the third set of intermediate sound signals 
by means of at least one HRTF from said first set of Head Related Transfer Functions in 
order to generate at least one output signal belonging to said first set of sound signals. 
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ABSTRACT: 



The present invention relates to a method and a media system of/for 
generation of at least one output signal (Hpl, H PR ) from at least one input signal from a 
second set of sound signals (M) having a related second set of Head Related Transfer 
Functions. The media system can be a TV, a CD player, a DVD player, a Radio, a display, an 
< amplifier, a headphone or a VCR. Said method includes the steps of deterrnining, for each 
signal in the second set of sound signals, a weighted relation (14) comprising at least one 
signal from a third set of intermediate sound signals (CHI1, CHI2) and at least one weight 
value (Weights); determining a first set of Head Related Transfer Functions (HRTFs) based 
on the second set of sound signals, the second set of Head Related Transfer Functions and the 
10 weighted relation; and transferring at least one signal from the third set of intermediate sound 
signals by means of at least one HRTF from said first set of Head Related Transfer Functions 
in order to generate at least one output signal belonging to said first set of sound signals. 
Hereby, in the end, fewer HRTFs are determined for a subsequent transfer of input signal(s) 
to output signal(s). Accordingly few convolutions are required. 

15 
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